Auditing hierarchical cycles to locate other inconsistencies in the UMLS.
نویسندگان
چکیده
A cycle in the parent relationship hierarchy of the UMLS is a configuration that effectively makes some concept(s) an ancestor of itself. Such a structural inconsistency can easily be found automatically. A previous strategy for disconnecting cycles is to break them with the deletion of one or more parent relationships-irrespective of the correctness of the deleted relationships. A methodology is introduced for auditing of cycles that seeks to discover and delete erroneous relationships only. Cycles involving three concepts are the primary consideration. Hypotheses about the high probability of locating an erroneous parent relationship in a cycle are proposed and confirmed with statistical confidence and lend credence to the auditing approach. A cycle may serve as an indicator of other non-structural inconsistencies that are otherwise difficult to detect automatically. An extensive auditing example shows how a cycle can indicate further inconsistencies.
منابع مشابه
Sculpting the UMLS Refined Semantic Network
BACKGROUND The Refined Semantic Network (RSN) for the UMLS was previously introduced to complement the UMLS Semantic Network (SN). The RSN partitions the UMLS Metathesaurus (META) into disjoint groups of concepts. Each such group is semantically uniform. However, the RSN was initially an order of magnitude larger than the SN, which is undesirable since to be useful, a semantic network should be...
متن کاملStructural group-based auditing of missing hierarchical relationships in UMLS
The Metathesaurus of the UMLS was created by integrating various source terminologies. The inter-concept relationships were either integrated into the UMLS from the source terminologies or specially generated. Due to the extensive size and inherent complexity of the Metathesaurus, the accidental omission of some hierarchical relationships was inevitable. We present a recursive procedure which a...
متن کاملAuditing the NCI Thesaurus with Semantic Web Technologies
Auditing biomedical terminologies often results in the identification of inconsistencies and thus helps to improve their quality. In this paper, we present a method based on Semantic Web technologies for auditing biomedical terminologies and apply it to the NCI thesaurus. We stored the NCI thesaurus concepts and their properties in an RDF triple store. By querying this store, we assessed the co...
متن کاملAnalyzing polysemous concepts from a clinical perspective: Application to auditing concept categorization in the UMLS
OBJECTIVES Polysemy is a frequent issue in biomedical terminologies. In the Unified Medical Language System (UMLS), polysemous terms are either represented as several independent concepts, or clustered into a single, multiply-categorized concept. The objective of this study is to analyze polysemous concepts in the UMLS through their categorization and hierarchical relations for auditing purpose...
متن کاملApproaches to Eliminating Cycles in the UMLS Metathesaurus: Naïve vs. Formal
Applications exploiting the hierarchical relations recorded in the Unified Medical Language System (UMLS) Metathesaurus suffer from the presence of inconsistencies in these relations. A formal approach to identifying and eliminating circular hierarchical relations has been proposed in previous work, leading to the creation of a directed acyclic Metathesaurus graph. However, this approach is at ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- AMIA ... Annual Symposium proceedings. AMIA Symposium
دوره 2011 شماره
صفحات -
تاریخ انتشار 2011